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Summary 

Tying teachers’ pay to their classroom performance should, says Victor Lavy, improve the cur- 
rent educational system both by clarifying teaching goals and by attracting and retaining the 
most productive teachers. But implementing pay for performance poses many practical chal- 
lenges, because measuring individual teachers’ performance is difficult. 

Lavy reviews evidence on individual and school-based incentive programs implemented in re- 
cent years both in the United States and abroad. Lavy himself evaluated two carefully designed 
programs in Israel and found significant gains in student and teacher performance. He observes 
that research evidence suggests, although not conclusively, that pay-for-performance incentives 
can improve teachers’ performance, although they can also lead to unintended and undesired 
consequences, such as teachers’ directing their efforts exclusively to rewarded activities. 

Lavy also offers general guidelines for designing effective programs. He emphasizes that the sys- 
tem must measure true performance in a way that minimizes random variation as well as unde- 
sired and unintended consequences. It must align performance with ultimate outcomes and 
must be monitored closely to discourage gaming if not outright fraud in measured output. Goals 
should be attainable. Incentives should balance individual rewards with school incentives, fos- 
tering a cooperative culture but not at the expense of free riding. All teachers should be eligible 
for the incentive offered, but only a subset of teachers should be rewarded in practice. If too 
many teachers are rewarded, teachers may not need to exert much extra effort to benefit. 

Many of the practical challenges faced by performance-related pay, Lavy says, can be ad- 
dressed through careful design of the system. He emphasizes that setting up a performance-re- 
lated pay system that works is not a one-time task. Even with the best preparation, initial imple- 
mentation is likely to be problematic. But if the effort is seen as ongoing, it should be possible 
to make progress gradually in developing incentives that motivate the desired teaching behav- 
iors and that will be perceived by teachers as fair and accurate. 
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I nterest in improving public education 
is growing not only in the United 
States but worldwide. One reason for 
the heightened public attention is the 
key role played by education in deter- 
mining both individual earnings and broader 
economic growth. Another is widespread dis- 
satisfaction with the education sector’s per- 
formance of late: substantial increases in 
spending on public schools have failed to 
bring corresponding increases in student 
achievement. 1 

The quest to improve public education has 
led policymakers and researchers to focus on 
how to increase teachers’ effectiveness. One 
obvious means is compensation. According to 
many observers, the traditional basis for 
teacher pay — years of service and educa- 
tion — provides little incentive for excellence. 
To make teachers more effective, these crit- 
ics argue, pay should be tied to performance. 
And some school districts, here and abroad, 
are undertaking reforms to test those ideas. 
In November 2005, for example, Denver vot- 
ers approved a $25 million tax increase to 
fund a form of “merit pay” to reward elemen- 
tary and secondary school teachers along a 
variety of dimensions, including their own 
demonstrated knowledge and skills and stu- 
dent academic growth. Whether Denver’s 
new merit pay system will improve student 
achievement remains uncertain; an earlier 
pilot study in Denver found mixed results. 2 

In this article I examine academic and policy 
analysis of performance-based reward pro- 
grams for primary and secondary school 
teachers. I stress, in particular, several ques- 
tions. What are the pros and cons of imple- 
menting teachers’ pay incentives in schools? 
What criteria are to be applied in designing 
optimal teacher incentives? How much is 
performance affected by incentives offered 


to practicing teachers? How will incentives 
affect the composition of applicants to 
teacher-training institutions and to teaching 
positions in the schools? What policy meas- 
ures can remedy existing distortions in teach- 
ers’ compensation? My intent is not to review 
exhaustively what is known about perform- 
ance-based pay in education, but rather to 
summarize selected key findings, highlight 
some guidelines for designing effective 
teacher incentives schemes, and identify 
areas requiring additional evidence. My ob- 
jectives are to present the theoretical bene- 
fits of performance-based pay as well as some 
of the practical obstacles to its effective im- 
plementation, to review critically the empiri- 
cal evidence, and to draw policy conclusions. 

Teachers’ Compensation 

Pay for performance is meant to solve the 
twofold problem of motivating high teacher 
performance while attracting and retaining 
good teachers under conditions where their 
effort or ability is not readily measured or ob- 
served. In the teaching profession, earnings 
are based primarily on input (that is, skills 
and time worked), rather than on output. 
Such a basis, critics say, is not “results- 
oriented.” Moving to an earnings structure 
that ties pay — at least partially — to some per- 
formance indicators should thus improve the 
current system. In theory, the idea makes 
good sense. But implementing pay for per- 
formance poses many practical challenges. In 
the teaching profession effort and output are 
difficult to define and measure because the 
work is generally complex, unique, and often 
results from team efforts, with any one 
teacher’s effort difficult to disentangle from 
that of the others on the team. One key goal 
of education is to give students the skills 
needed to ensure a productive career and 
sustain their economic well-being. Yet, be- 
cause it takes years for the adult earnings of a 
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student to materialize, it is impossible to tie a 
teacher’s wages to his students’ earnings. Re- 
cent studies do in fact suggest that students’ 
test scores are strongly correlected with their 
future earnings, but using test scores to 
measure a teacher’s performance presents 
practical problems. 0 Peer or principal evalua- 
tions are yet another way to measure per- 
formance, given the drawbacks of testing and 
the many teachers for whom testing would 
not apply. But these approaches also have 
drawbacks. 

Performance-Based Pay 

Pay based on performance usually involves 
some objective assessment of schools’ or 
teachers’ efforts or success or some measure 
of their students’ performance. Performance- 
based pay schemes have many variable fea- 
tures. They can compensate teachers only for 
their own performance or they can be struc- 
tured as a team incentive program, with 
group performance determining the total in- 
centive payment, which is then divided 
among team members regardless of individ- 
ual performance. The group can include all 
of the school’s teachers or a subgroup, such 
as the teachers of a given grade or a specific 
subject. Performance-based pay schemes 
can, but need not, involve sanctions for 
below-threshold performance. Although 
monetary rewards are the most common in- 
centive in performance-related pay, other in- 
centives can include reduced teaching load, 
promotion, and public recognition of out- 
standing teachers. The reward can be just a 
one-time event or it can be ongoing, leading 
to a permanent salary increase. It can be 
based on a relative criterion (for example, the 
average test score gain of a teacher’s class rel- 
ative to the classes of other teachers) or on an 
absolute criterion (such as the class average 
test score being higher than a predetermined 
threshold). The reward may be a fixed sum 


that is equal for all winners, or it can vary and 
increase with the winner’s level of achieve- 
ment. The total amount of awards may be 
predetermined (for example, only a certain 
number of teachers can win an award) or it 
may be open. The performance criteria can 
include outcomes for the teachers them- 
selves, such as measures of absenteeism or 


Pay based on performance 
usually involves some 
objective assessment of 
schools’ or teachers’ efforts or 
success or so?ne measure of 
their students’ performance. 


performance on a test. They can also include 
measures of the teacher’s students’ perform- 
ance, such as attendance, grade retention, 
dropout rates, or performance on tests. 
These criteria are not mutually exclusive. 

The target set for determining award winners 
is critically important both for efficiency and 
for equity. For example, if schools are ranked 
according to how many students attain a cer- 
tain level of literacy as determined by an ex- 
amination, gains near the cutoff are most re- 
warded. But if schools or teachers are 
rewarded on the basis of average test scores 
or changes in those averages, then credit is 
given for gains at all parts of the achievement 
distribution, not just those close to the cutoff 
point. For example, students in the bottom 5 
percent of the achievement distribution 
might be too far away from the literacy stan- 
dard to pass the test after one or two years, 
but raising their test scores might be worth- 
while nonetheless. A hybrid measure could 
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target both some cutoff as well as some aver- 
age. It is also possible to design a scheme that 
differentially weights improved test scores at 
different points of the achievement distribu- 
tion, with the bottom weighted more than 
the top, the middle more than either ex- 
treme, and so forth. Finally, an important, 
but potentially underappreciated, part of a 
pay-for-performance plan is the identity of 
those who evaluate the teachers. The evalua- 
tors can be external to the school or can be 
peers, principals, or district supervisors. 

Despite the almost innumerable combina- 
tions offered by these options, three proto- 
types of performance-based reward programs 
are most often implemented in education sys- 
tems and are commonly examined by re- 
searchers. The first model, merit pay, gener- 
ally involves individual incentives based on 
student performance. The second, knowl- 
edge- and skill-based compensation, generally 
involves individual incentives based on 
teacher skills. Knowledge- and skill-based pay 
differs from merit pay because it provides 
clear guidelines on what is being evaluated. 
The third model, school-based compensation, 
generally involves schoolwide incentives, typi- 
cally based on student performance. 

Potential Benefits of 
Performance-Based Pay 
Performance-based pay in education brings 
with it many potential benefits but also many 
challenges. This section and the next present 
the main issues. 

Productivity and Efficiency Considerations. 
In theory, pay based on output has two ad- 
vantages over input-based pay in terms of ef- 
ficiency (that is, producing “more” education 
for the same cost). The first, most frequently 
noted efficiency advantage has to do with in- 
centives. Rewarding teachers or schools on 


the basis of an agreed metric aligns incen- 
tives directed at teachers or schools with 
those directed at students and potentially the 
entire society. If wages are based on student 
performance, for example, they provide 
teachers or schools with powerful signals 
about what is valued and what is not. Absent 
such signals, even well-meaning teachers may 
emphasize material that is obsolete or gener- 
ally no longer valued by parents or the labor 
market. Similarly, if wages are based not only 
on the individual benefits of schooling to stu- 
dents (social scientists call these “private re- 
turns”) but on the benefits to society as a 
whole (“social returns”), teachers or schools 
would take into account the social returns to 
education when making choices about their 
work. A student, for example, may want to 
drop out before completing high school be- 
cause he feels that the costs of staying in 
school outweigh his individual benefits. A 
teacher considering only those individual 
benefits may not work as hard to discourage 
him from dropping out as would a teacher 
considering the costs and benefits to society 
as a whole. 

Individual performance-based pay schemes 
improve efficiency by helping correct distor- 
tions in a teachers effort that might result 
from gaps between her preferences and 
those of her students. For example, a teacher 
might fail to assign homework even though 
she knows its value for her students because 
correcting and grading assignments involves 
more work for herself. Individual perform- 
ance-based pay provides some incentive for 
the teacher to do the “right thing.” 

The second efficiency advantage of output- 
based pay, mainly relevant for the merit pay 
model, involves sorting and selection. Assum- 
ing that the compensation system accurately 
identifies productivity, basing pay on per- 
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formance will attract and retain the most pro- 
ductive teachers. Even if teachers are unable 
to alter their own behavior to enhance per- 
formance, as measured, say, by students’ test 
scores, some people are still inherently better 
than others at affecting test scores. Basing 
pay on output also tends to discourage teach- 
ers who cannot enhance their students’ per- 
formance from remaining in the profession. 
A related point is that output-based pay will 
create a market for teaching quality that will 
help teachers move to schools where their 
talent is most highly valued. Equalization be- 
tween productivity and wages will result, with 
poorly performing teachers receiving re- 
duced wages and lower probabilities of pro- 
motion, and more capable teachers com- 
manding better options. Finally, if teachers 
are able to improve their classroom perform- 
ance, linking compensation to performance 
will provide all teachers incentives to im- 
prove through professional development, 
which will therefore induce still further pro- 
ductivity gains . 4 

Performance-related pay based on individual 
or schoolwide schemes could also improve 
school productivity by inducing better gover- 
nance. For one thing, it requires school prin- 
cipals to monitor closely the quality of their 
teachers’ work . 5 It is also assumed to bring 
about more coherent and common teacher- 
management goals in addition to an im- 
proved flow of information and feedback 
among all school agents. This result is as- 
sumed to flow from a common interest in im- 
proved outcomes . 6 

Other Potential Benefits. Critics of traditional 
pay schemes that reward experience and for- 
mal qualifications instead of performance 
argue that these schemes are unfair to highly 
motivated, effective, and efficient teachers 
whose extra efforts are not rewarded.' Per- 


formance-based pay can thus make compen- 
sation systems more equitable. 

Finally, performance-based pay may increase 
support for public education from politicians 
and members of the public who are con- 
vinced that the reform will reverse the edu- 
cation sector’s poor reputation and perceived 
inefficient use of resources . 8 

Potential Drawbacks to 
Performance-Based Pay 
Despite its theoretical benefits, performance- 
based pay offers many practical challenges. 

Measurement Problems. Performance meas- 
urement poses two separate problems for per- 
formance-based pay. Incentive systems as- 
sume that everyone can agree on goals; they 
also assume that it is possible to measure ac- 
curately progress toward these goals. Agree- 
ing on goals is particularly difficult in educa- 
tion because competition between public 
schools is rare. In the private sector, market 
mechanisms discipline firms into providing 
products that consumers value, but public 
schools lack market discipline. Schooling is 
compulsory and public, and students are sim- 
ply assigned to attend their neighborhood 
school. Parents and students who are unhappy 
with what their schools offer generally have no 
alternative except to attend a private school or 
move to another neighborhood or city — alter- 
natives that are too costly for many . 9 

The other measurement issue represents the 
most common claim made against perform- 
ance-based pay: evaluating progress toward 
the goal fairly and accurately is problematic. 
This is especially so when evaluation is based 
on proxies (as it often is), such as self- 
reported effort and motivation . 10 Identifying 
precisely what one teacher contributes to a 
student’s performance and separating his 
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contribution from those of other teachers, 
the school, the principal, and the family is ex- 
tremely difficult. 11 Compounding the prob- 
lem is the fact that students are often deliber- 
ately assigned to specific teachers — that is, 
the assignment of teachers to classes is not 
random. A still further complication is how to 
identify the contributions of previous teach- 
ers, who may have been superior or inferior. 


Pay based on reading and 
math test scores, for example, 
might encourage teachers to 
favor those subjects at the 
expense of say, music and 
art or values and civic 
responsibility. 


Negative Effects on Motivation and Collegial- 
itij. Another concern is that implementing in- 
dividual-based incentives may create unfair 
competition between teachers, especially in 
the absence of transparent criteria, thus un- 
dermining collaboration. Even if evaluation is 
accurate and fair, teachers may still feel ag- 
grieved if their competence is questioned. 
Evaluation may also create new hierarchies 
by giving administrators an additional source 
of power over teachers and the curriculum. 
Individual incentives could also undermine 
principal-teacher relationships because of the 
asymmetry in how each party views teacher 
evaluation: teachers use it to determine how 
they are performing and how they can im- 
prove, while principals use it to measure 
teachers’ contribution to the school. 12 

Unintended Consequences. Some analysts 
caution that performance-based pay may 


have unintended consequences. Teachers, for 
example, may focus on the easiest way to in- 
crease the rewarded measure while ignoring 
measures that schools and parents ultimately 
want to improve. 13 Similarly, when one di- 
mension of output is easily measured but an- 
other is not, teachers may dedicate their ef- 
forts to maximizing the measurable at the 
expense of the unmeasured dimension. Col- 
lectively, such efforts could even begin to 
constrict a school’s curriculum to measurable 
subjects. 14 A further risk is that because test 
scores measure only certain skills, linking 
compensation to test scores might cause 
teachers to sacrifice the nurturing of curiosity 
and creative thinking to teaching the skills 
tested on standardized exams — a practice 
known as teaching to the test. 1,0 Pay based on 
reading and math test scores, for example, 
might encourage teachers to favor those sub- 
jects at the expense of, say, music and art or 
values and civic responsibility. 16 A teaching- 
to-the-test mentality is thus assumed to sup- 
port the creation of a system where a narrow 
curriculum necessarily restricts student 
achievement in domains not tested. 

Unintended consequences may also arise if 
teachers “game play” and develop responses 
that generate rewards contradicting the pro- 
fession’s spirit. 1 ' In other words, measuring 
student output may stimulate teachers to par- 
ticipate in inappropriate or deviant behavior, 
such as cheating. Using data from Chicago’s 
public schools, Brian Jacob and Steve Levitt 
detected cheating in approximately 4 to 5 
percent of the classes in their sample. 18 They 
also found that cheating responds swiftly to 
changes in teacher incentives. After stan- 
dardized tests took on increased salience in 
Chicago’s public schools in 1996, the preva- 
lence of cheating rose sharply in low- 
achieving classrooms, but not in classes with 
average or above-average students. The 
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prevalence of cheating also appeared to be 
systematically lower where the costs of cheat- 
ing were higher or the benefits of cheating 
lower, as in classrooms where a large number 
of students’ test scores were excluded from 
official calculations because they were bilin- 
gual. Other studies of unintended conse- 
quences include altering school lunch menus 
during testing periods in an apparent attempt 
to artificially increase student test scores and 
manipulating who takes the test . 19 

Providing financial incentives to improve 
performance may be counterproductive in 
other ways as well. First, it may demoralize 
teachers, resulting in reduced effort. In labo- 
ratory experiments, one study found that 
workers in high-powered incentive systems 
may become unmotivated and thus work less 
than they would under a flat wage regime . 20 
Second, financial incentives may undermine 
intrinsic motivation, that is, the sense of duty 
or satisfaction that motivates coming to 
work . 21 This threat is particularly real for 
teachers, who, as a group, exhibit strong in- 
trinsic motivation flowing from the value they 
place on interacting with children and seeing 
them succeed . 22 

Another potential distortion is that teachers 
may focus disproportionate attention on 
those students who are most likely to im- 
prove their test scores or to cross a desig- 
nated threshold . 23 The highest- and lowest- 
performing students may consequently be 
neglected because they do not promise ade- 
quate returns on investments of teachers’ 
quality time. 

Risks Posed to Teachers Could Increase 
Costs. The risks posed to teachers by per- 
formance-based pay could lead them to de- 
mand high compensation, which could in 
turn raise the cost of education . 24 Unlike rel- 


atively risk-free input-based payment, per- 
formance-based pay exposes employees to 
earnings variability beyond their control. If 
teachers, like other workers, are risk averse, 
inducing them to accept a risky compensa- 
tion package will entail higher average pay 
overall. 

Teachers Are Motivated hij Nonfinancial In- 
centives. A frequent criticism of performance- 
based pay is that teachers, as professionals 
relatively immune to motivation by pecuniary 
rewards, will not respond to financial incen- 
tives. Monetary rewards could thus simply in- 
flame resentment toward management and 
decrease employee loyalty, both of which 
could reduce productivity. One study suggests 
that nonmonetary rewards, such as additional 
holidays, may be better motivators . 25 

Union and Teachers’ Opposition. Teacher 
unions worldwide strongly oppose perform- 
ance-based pay . 26 Unions view wage differen- 
tiation on the basis of subject taught, as well 
as any sort of subjective evaluation of teach- 
ers, as threats to their collective bargaining 
strategies and therefore reject them outright. 
And union views weigh heavily: lobbying by 
unions has often halted efforts to legislate 
performance-based rewards. 2 ' Union objec- 
tions appear to reflect opposition voiced by 
teachers directly . 28 Teachers see perform- 
ance-based pay, supported by unfair evalua- 
tion, as a threat to their autonomy. Sanctions 
against poorly performing schools, which are 
included in some performance-based 
schemes, are another major source of union 
and teacher opposition . 29 

Disappointing Experience with Past Merit 
Paij Programs. The repeated failures of 
poorly designed and implemented merit pay 
programs over the past two decades have un- 
dermined the credibility of new and better- 
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designed initiatives. A key weakness in past 
programs has been opaque goals, which make 
it hard for teachers to understand the pro- 
gram and undermine their support for it. 
Opaque goals also make it difficult for admin- 
istrators to explain why some staff members 
receive a bonus and others do not. One study 
finds that even in established programs such 
as those implemented in Kentucky and North 
Carolina, many participants remain skeptical 
that bonuses go to qualified teachers . 30 

The High Cost of Performance-Based Pay 
Schemes. Finally, implementing perform- 
ance-based pay is easier in small organiza- 
tions, such as private schools, than in large 
public school systems with sizable teaching 
staffs. System size therefore impinges on the 
observed high cost of performance-related 
pay, making the program infeasible. One 
study argues that adequate evaluation of 
every teacher, expensive in itself, would re- 
quire considerable resources if performed 
regularly . 31 The time alone required to ad- 
minister a pay-for-performance system would 
have severe budgetary implications . 32 More- 
over, as a research study points out, improved 
productivity in the private sector can gener- 
ate added income to help mitigate budget 
problems, but enhanced productivity in pub- 
lic schools has no such effect . 33 

Overcoming Some of the Obstacles 
Several of the many potential obstacles to im- 
plementing an effective performance-related 
pay system can be addressed. For example, 
one solution to the measurement problem is 
to compensate teachers on the basis of princi- 
pal evaluations. Brian Jacob and Lars Lefgren 
compared subjective principal assessments 
with measures of teacher effectiveness based 
on gains in student achievement on standard- 
ized tests — measures often known as teacher 
value added — and found that principals are 


quite good at identifying those teachers who 
produce the largest and the smallest standard- 
ized achievement gains in their schools, but 
far less able to distinguish between teachers 
in the middle of the distribution . 34 They also 
found that principals systematically discrimi- 
nate against male and untenured faculty . 35 

A principal-based assessment system would 
likely result in higher student achievement 
than todays input-based compensation sys- 
tem. But Jacob and Lefgren cite an impor- 
tant limitation of their research. First, the 
principals whom they examined were not 
themselves evaluated explicitly on the basis 
of their ability to identify effective teachers. 
Moving to a system where principals have 
more authority and responsibility for moni- 
toring teacher effectiveness might enhance a 
principal’s capacity to identify the required 
characteristics. But principals may be less 
willing to assess teachers honestly under such 
a system, perhaps in response to social or po- 
litical pressures. Further, the inability of 
principals to distinguish between teachers in 
a broad middle range of quality suggests cau- 
tion in relying on principals for the finely 
tuned performance determinations that 
might be required under certain merit pay 
policies. 

In response to the concern that merit pay 
models may hamper collaboration, one could 
structure the system to reward teacher coop- 
eration, especially through group-based 
pay . 36 This strategy can foster both teacher 
interdependence and acknowledgement of 
that interdependence. That said, team-based 
incentive systems raise the risk of “free rid- 
ing.” 3 ' If each teacher’s share of the team re- 
ward is small relative to the cost of effort and 
if effort is difficult to observe, every teacher 
in the team will have an incentive to shirk 
and free ride on the efforts of others . 38 One 
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way to avoid this problem is to encourage 
peer pressure and mutual monitoring within 
the team . 39 

Fears that teachers would need to be highly 
compensated for the increased risk in a per- 
formance-based compensation system are 
probably overstated. If teachers are paid on 
the basis of student performance, and if the 
number of students whom teachers teach 
each year is high, the year-to-year variation in 
average class test scores is likely to be small. 
Furthermore, even under the most ambitious 
schemes only a fraction of teachers’ wages 
would be tied to performance, thus making 
compensation based on incentives only a 
marginal component of pay. It is thus unlikely 
that earnings will fluctuate by more than a 
few percent annually around some basic 
trend. 

The idea that teachers themselves — as re- 
flected in the positions of their unions — op- 
pose performance-based rewards may also be 
overstated. One study found that most teach- 
ers favor additional pay for additional duties 
per se and as a component of a career ladder 
where performance dictates the speed of ad- 
vancement . 40 Unsurprisingly, performance- 
based rewards are more popular when they 
supplement, rather than replace, other forms 
of salary. 

The same study found that the pay level in a 
school district appeared to have no influence 
on teachers’ attitudes toward merit pay, al- 
though teachers who were paid low salaries 
and who belonged to ethnic minorities were 
more likely than others to support the pro- 
gram. Attitudes toward merit pay were inde- 
pendent of the number of students who were 
eligible for free lunches, suggesting that stu- 
dents’ socioeconomic status did not affect 
teachers’ views on merit pay. Interestingly, 


private school teachers viewed performance- 
related pay more favorably than their public 
school counterparts . 41 Teachers’ attitudes 
thus appear relatively malleable and to de- 
pend on program design. 

Teachers’ and union objections were over- 
come in some cases when specific interest 
groups and legislators supported perform- 


Fears that teachers would 
need to he highly compen- 
sated for the increased risk 
in a performance-based 
compensation system are 
probably overstated. 


ance-based pay. But political turnover makes 
such support fragile, particularly in times of 
economic recession, because the cost of 
performance-based pay is more visible than 
are the benefits of improved student 
achievement. 42 

Many of the practical challenges faced by 
performance-related pay, then, can be ad- 
dressed through careful design of the system. 
And despite the opposition of teachers 
unions to performance-based compensation, 
it is not clear that the objections to such sys- 
tems come from the teachers themselves. 

Evidence on School-Based 
Performance Systems 

In this section I review evidence on several 
school-based incentive programs imple- 
mented in recent years both in the United 
States and in other countries. The programs 
vary in their basic structure and details, with 
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some targeted at teams of teachers and oth- 
ers at individuals. 43 

Evidence from the United States 
Although school-based performance pay the- 
oretically has many attractive features, re- 
searchers have been able to find little causal 
evidence that it is effective in U.S. programs. 
For example, three researchers studied 
school-based incentive pay systems in Ken- 
tucky, North Carolina (Charlotte-Mecklen- 
burg), and Maryland. 44 They concluded that 
in the Charlotte-Mecklenburg and Kentucky 
programs, but not in the Maryland program, 
both teacher motivation and student out- 
comes improved. But because all three stud- 
ies lacked a control group, they could not es- 
tablish definitively that the program 
itself — and not some other factor — was the 
cause of the improvements. 45 

Similarly Helen Ladd studied a school-based 
bonus program in Dallas. 46 The program, 
which began in the 1991-92 school year and 
continued through 1995, ranked schools by 
how well their students’ test scores compared 
with state average scores, adjusting for stu- 
dents’ socioeconomic status. To avoid teach- 
ing to the test or other gaming behavior, the 
program relied on multiple measures of stu- 
dent outcomes, including two tests given 
each year. Ladd compared gains in school- 
level test scores in Dallas with gains in other 
cities (adjusting for many school characteris- 
tics, such as racial mix and relative depriva- 
tion) to evaluate the impact of this bonus 
scheme. She found that pass rates appeared 
to increase more quickly in Dallas than in 
other cities. Effects were most positive for 
Hispanics and whites and insignificant for 
blacks. Although the study suggests that a 
school-based program can be effective, it was 
not conclusive. It had, for example, only a 
limited number of student and school charac- 


teristics to adjust to make the participating 
schools comparable to other schools in the 
state. In addition, the test score gains in Dal- 
las may have been part of a trend that started 
before the program was implemented. 

The Dallas study also highlights some unin- 
tended consequences. In an earlier study, 
Charles Clotfelter and Ladd had reported 
that in the Dallas program, schools of low so- 
cioeconomic status rarely won awards. 4 ' In 
response, the state divided schools into five 
groups based on socioeconomic characteris- 
tics and rewarded the top performers in each 
group. But some of the lower-performing 
schools in the upper socioeconomic bands 
felt that they had been treated unfairly. Di- 
viding the schools into socioeconomic groups 
also encouraged an undesired strategic re- 
sponse from principals who realized that 
their ability to gain an award was based on 
the socioeconomic category into which they 
were placed. 

Finally, two studies of a South Carolina per- 
formance-based program that included both 
school-based and individual-based rewards 
found that student performance improved. 48 
The studies, however, may overstate the in- 
centive effects because teachers could 
choose whether to apply for an award. If, as 
would be expected, only the most productive 
teachers chose to apply, then part of the stu- 
dent gains may be attributable not to the in- 
centives but to the fact that participants were 
better teachers in the first place. 

International Evidence 

One of the stronger examples of a school- 
based incentive program comes from Israel. 
In February 1995, Israel announced a com- 
petition for a monetary bonus for secondary 
schools and teachers based on their students’ 
performance. 49 The objectives were to re- 
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duce dropout rates and improve scholastic 
achievement. The three performance meas- 
ures were average number of credits per stu- 
dent, share of students receiving a matricula- 
tion diploma, and school dropout rate. 

Sixty-two schools were initially selected for 
the program, with several schools added 
later. In 1996, participating schools com- 
peted for about $1.5 million in awards. 
Schools were ranked according to their an- 
nual improvement, adjusting for the socio- 
economic background of the students. Only 
the top third of performers won awards. The 
distribution of cash incentives among the 
award-winning schools was determined solely 
by their ranking in terms of relative improve- 
ment (in 1996, the highest-scoring winner 
won $105,000; the lowest-scoring, $13,250). 
Teachers received 75 percent of the award as 
a salary bonus (proportional to gross income); 
the remainder was used to improve faculty 
facilities, such as teachers’ common rooms. 
In 1996, the bonuses ranged from 1 to 3 per- 
cent of average teacher salary. The combined 
performance of a team determined the total 
incentive payment, which was split among in- 
dividuals regardless of performance. 50 

The student outcomes rewarded included 
most of those that can be affected by teach- 
ers, thereby reducing the dilemma teachers 
are assumed to face regarding how to allocate 
their time between rewarded and nonre- 
warded activities. School averages of all three 
performance measures were based on the size 
of the graduating cohort while in ninth grade 
rather than in twelfth grade. This procedure 
was adopted to discourage schools from gam- 
ing the incentive system — by encouraging 
weak students to transfer or drop out or by 
placing them in the nonmatriculation track. 
To encourage schools to direct more effort to- 
ward weak students, only the first 22 credit 


units taken by each student were counted in 
computing the school’s mean to determine its 
rank in the bonus program. 

Two years after the program was imple- 
mented, I compared the program schools 
with a control group and found significant 
gains in student performance in the former. 51 
Average credits were 0.7 unit higher, the 
share of students sitting for matriculation ex- 
aminations increased by 2.1 percent, and av- 
erage scores and passing rates in these exam- 
inations improved as well. Of particular 
importance was the decline in the dropout 
rate in students’ transition from middle to 
high school. The programs also appeared 
mainly to affect weaker students. 

Another analysis of a school-based teachers’ 
incentive program, this one in Kenya, exam- 
ined effects on both teacher behavior and test 
scores. 52 The program randomly assigned fifty 
Kenyan primary schools to a treatment group 
eligible for monetary incentives (21—43 per- 
cent of monthly salary). The winning schools 
were determined by their average test score 
performance relative to other treatment 
schools in districtwide examinations; all teach- 
ers in the winning schools received awards. 
The program penalized schools for dropouts 
by assigning low scores to students who did 
not take the examination. Data were collected 
on many types of teacher effort — teacher at- 
tendance, homework assignments, pedagogi- 
cal techniques, and holding extra test prepara- 
tion sessions — and on student scores obtained 
after the program’s conclusion. 

During the two years the program was in 
place, student scores increased significantly 
in treatment schools (0.14 standard deviation 
above the control group). But the gain in 
scores was not attributable to the expected 
incentive-induced changes in teacher behav- 
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ior. In fact, teacher attendance did not im- 
prove, and no changes were found in either 
homework assignment or pedagogy. Instead, 
teachers were more likely to conduct test 
preparation sessions outside regular class 
hours. Data collected the year after the pro- 
gram ended showed no lasting test score 
gains, suggesting that the teachers focused on 
improving short-term rather than long-term 
learning. Consistent with this hypothesis, the 
program had no effect on dropout rates even 
though examination participation rose (pre- 
sumably because teachers wanted to avoid 
penalties for no-shows). The test score effect 
was also strongest in geography, history, and 
Christian religion, arguably subjects involv- 
ing the most memorization. 

Summary 

Although group-based pay, either alone or 
combined with individual-based incentives, 
has the promise of overcoming some of the 
difficulties inherent in implementing individ- 
ual-based systems, the little causal evidence 
of its effectiveness is mixed. The strongest 
evidence comes from the Israeli experience; 
whether it could be replicated either in the 
United States or abroad is unknown. 

Evidence on Individual-Based 
Performance Systems 

In this section I review evidence on several 
individual-based incentive programs, again 
both in the United States and abroad. 

Evidence from the United States 

Studies of individual-based incentive schemes 
in the United States have had some success in 
isolating the programs’ causal effects on stu- 
dent outcomes. But their findings have also 
been quite mixed. For example, one study as- 
sessed the effect on student achievement of a 
merit pay scheme in Michigan that rewarded 
individual teachers according to student re- 


tention rates and evaluation questionnaires 
completed by their students. 53 The scheme, 
which did not directly target student achieve- 
ment, did improve student retention. But 
pass rates fell, while attendance rates and 
grade point averages remained unchanged. 
The authors concluded that “incentive sys- 
tems within complex organizations such as 
schools may produce results that are unin- 
tended and at times misdirected.” 

In contrast, another study combined panel 
data from the U.S. National Education Lon- 
gitudinal Survey of 1988 (NELS88) to esti- 
mate the effects of teacher incentives on 
student outcomes. 54 The authors defined in- 
centive schemes as any merit raise or bonus 
awarded to any proportion of teachers in a 
school, although the variables did not identify 
whether schemes stipulated that rewards 
were to be tied directly to student achieve- 
ment. The wealth of data in the survey 
enabled the authors to control for many stu- 
dent, teacher, school, and family characteris- 
tics to make it easier to compare students 
taught by treatment teachers (those in the in- 
centive scheme) with students taught by 
teachers in a control group. The results were 
positive, particularly in public and poor (low 
economic status) schools. Test scores were 
higher when awards were higher and when 
awards were given only to a few teachers 
within a school. 

Finally, a third study analyzed incentive ef- 
fects on student SAT scores in the Tennessee 
STAB (Student Teacher Achievement Batio) 
and Career Ladder Evaluation programs. 55 It 
controlled for student and teacher character- 
istics as well as for class attributes that do not 
change over time (by including class fixed ef- 
fects based on panel data). It found that SAT 
scores improved, with gains varying across 
subjects and with teacher seniority. 
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One reason for the mixed evidence may be 
that the studies combine students at all grade 
levels. One analysis of merit pay reforms in 
South Carolina in the 1980s and 1990s sug- 
gested that merit pay might be more effective 
in earlier grades than in later grades. More 
generally, the study cautioned that the effects 
of performance-based pay may vary across 
countries, schools, population groups, or time. 

International Evidence 

The first international example of an individ- 
ual-based program is an experiment, begun in 
fifty high schools in Israel in December 2000, 
that offered teachers a bonus based on stu- 
dent achievement. The experiment included 
all English, Hebrew, Arabic, and mathematics 
teachers who taught tenth- to twelfth-grade 
classes in preparation for matriculation exam- 
inations in these subjects in June 2001. Each 
teacher was ranked separately on the basis of 
the mean performance of each class she 
taught. The ranking was based on the differ- 
ence between actual class performance and 
performance predicted on the basis of stu- 
dents’ socioeconomic characteristics, their 
level of proficiency in each subject, and a 
fixed school-level effect. Each teacher was 
ranked twice, once for the students’ passing 
rate and once for average score. 

Each school submitted student enrollment 
lists, itemized by grade, subject, and teacher, 
on the program’s starting date. All students 
on these lists were included in the class mean 
outcomes. Students who dropped out or did 
not take the exams, regardless of the reason, 
were imputed a score of zero to neutralize 
any incentive for teachers to keep poorly per- 
forming students out of the tests. 

All teachers who performed better than pre- 
dicted in both passing rate and average score 
were ranked from first to fourth place and 


awarded points according to ranking. The 
awards, based on total points, ranged from 6 
to 25 percent of the average annual income of 
high school teachers. A teacher could win sev- 
eral awards if she prepared more than one 
class for a matriculation examination. 56 Of the 
629 teachers in the program, 302 won awards. 

My analysis of the program found that it sig- 
nificantly improved matriculation examina- 


Although group-based pay 
. . . has the promise of 
overcoming so?ne of the 
difficidties inherent in 
implementing individual- 
based sy stems, the little 
causal evidence of its 
effectiveness is mixed. 

tion participation rates as well as the passing 
rate and average test scores among those who 
took the test. 5 ' These gains accounted for 
about half of the improved outcomes among 
all students. They appear to have resulted 
from changes in teaching methods, after- 
school teaching, and increased responsive- 
ness to students’ needs, not from artificial in- 
flation or manipulation of test scores. The 
evidence that the incentive program im- 
proved teacher effort and pedagogy is impor- 
tant in the context of concerns about the pro- 
grams’ unintended effects, such as teaching 
to the test or cheating and the manipulation 
of test scores, and fears that such programs 
do not produce real learning. 

As a second example, in 1999 the United 
Kingdom introduced a systemwide perform- 
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ance-related pay policy for teachers using 
student progress (value added) as one key 
criterion. 58 Using long-term teacher data and 
a before-and-after comparison research de- 
sign, one study evaluated the policy’s effect 
on test scores on the important GCSE (Gen- 
eral Certificate of Secondary Education) 
exams, taken by students at age sixteen at the 
end of compulsory education. 59 Because the 
incentive scheme was explicitly teacher- 
rather than school-based, the study followed 
teachers over two complete two-year teach- 
ing cycles before and after the policy was in- 
troduced. Students were linked to teachers 
who taught them specific subjects, making it 
possible to compare treatment and control 
group teachers. This panel data structure also 
made it possible to control for student and 
teacher unchanged characteristics (by includ- 
ing respective fixed effects) and to measure 
the scheme’s target: student progress. 

The study reported statistically and econom- 
ically significant student progress. For in- 
stance, relative to control teachers, treat- 
ment teachers increased their value added 
by almost half a GCSE grade per student, 
equal to 0.73 standard deviation. Also signif- 
icant were differences between school sub- 
jects, with treatment math teachers showing 
no effect from participating in the program. 
Although promising, again, the study is not 
definitive. One concern is that it applied 
only to teachers who had been in the profes- 
sion for about eight years, so that treatment 
and control teachers differed systematically 
in experience. If teachers improve in their 
capacity to generate value added as they gain 
more experience but at a decreasing rate, 
then, all else being equal, one can expect to 
see greater improvements in progress be- 
tween the two teaching cycles for the less ex- 
perienced (control) group. Taking teacher 
experience into account in the analysis may 


not solve the problem if the relationship be- 
tween teachers’ experience and productivity 
is nonlinear. 

Summary 

In general, the evidence suggests that well- 
designed individual-based incentives can sig- 
nificantly improve student outcomes. But the 
research base is small, and implementing 
purely individual-based programs presents 
many challenges. 

Policy Implications of the 
Evidence on Teachers’ Incentives 

Besearcli evidence suggests, although not 
conclusively, that pay-for-performance incen- 
tives can improve teachers’ performance. Yet 
these incentives can also lead to unintended 
and undesired consequences growing out of 
their inherent structural challenges, such as 
measurement issues or the possibility of 
teachers’ directing their efforts exclusively to 
rewarded activities. Although there are no 
magical cures for any of these problems, it is 
possible to draw lessons from the experience 
and evaluation of the better performance- 
related pay programs. In this section I offer 
some general guidelines for designing effec- 
tive programs. 

Which Outcomes to Reward 

Teachers should be evaluated on the basis of 
their true performance, not random variation 
in performance. Therefore, the performance 
measures chosen should exhibit relatively lit- 
tle random variation. Because teachers are 
expected to direct their efforts according to 
the incentives provided, the performance 
measures should cover all outcomes of inter- 
est, including quality and quantity (relevant 
examples are the two outcome measures 
used in the Israeli pay programs — passing 
rate and average score). Outcome measures 
should be as close as possible to the ultimate 


100 THE FUTURE OF CHILDREN 


Using Performance-B ased Pay to Improve the Quality of Teachers 


outcomes, difficult to game, and easy to mon- 
itor regularly. 

Monitoring 

Close supervision and monitoring are needed 
to minimize gaming and shading the truth, if 
not outright fraud, in measured output. Be- 
cause monitoring can be costly, an appropri- 
ate strategy should balance the trade-off be- 
tween penalties and probability of detection. 
The matriculation examination system in Is- 
rael minimizes gaming: students are tested 
twice — in a school and a state examination — 
in each subject, and their final score is a 
weighted average of the two. Large gaps be- 
tween the scores invite penalties. Nonran- 
dom gaps between the two scores are audited 
at the class and school levels. Obviously, even 
the best-designed measurement and moni- 
toring system may lead to inappropriate real- 
location of effort because no single system is 
likely to capture all important behavior. One 
way to avoid this constriction in effort is to in- 
clude two outcomes, for instance, test scores 
and processes (for example, attention di- 
rected explicitly to disadvantaged students) 
and teacher practices (for example, teacher 
absenteeism). 

Reward Criteria 

Teachers can be compared according to stu- 
dent performance levels or to some form of 
gain in performance. Value-added measures, 
in particular, are more appropriate for meas- 
uring individual teacher effectiveness and en- 
suring fair ranking. Such methods can sub- 
stantially affect school or teacher rankings, in 
addition to the behaviors engaged in to game 
the system. Performance levels create incen- 
tives to pay greatest attention to highest- 
performing students; value-added criteria may 
increase attention to lowest-performing stu- 
dents. It is thus important to use performance 
measures that can be adjusted for contribut- 


ing factors, such as student socioeconomic 
characteristics, but not at the cost of complex 
value-added measures that lack transparency 
and are difficult to understand and accept. In 
addition, goals should be attainable and not be 
perceived as too ambitious. 

Rewarding Both Short- and 
Long-Term Measures 

Performance-related pay programs typically 
focus on key short-term schooling objec- 
tives — such as increasing the number of stu- 
dents reading at a given grade level — rather 
than on long-term objectives related to post- 
secondary education and labor market out- 
comes. This emphasis is due in large part to 
the difficulty of measuring long-term objec- 
tives, as well as to an almost universal belief 
that achieving short-term goals is a necessary 
condition for promoting long-term goals. But 
medium-term outcomes, such as dropout 
rates, can be affected by performance-related 
pay. The team-based program developed in 
Israel demonstrates how a mix of short- and 
medium-term goals can be achieved simulta- 
neously in an effective performance-related 
pay program. 

Subjective and Objective Evaluation 
Subjective evaluation invites the problem of 
performance measure inflation (namely, 
that teachers under the pressure of incen- 
tives tend to give higher than warranted 
scores), while objective metrics invite the 
problems of measurement cost and the in- 
ability to encompass all school targets and 
goals. Given these limitations, it may be ap- 
propriate to use both — each imperfect but 
still informative. For example, selective em- 
phasis can be assigned to each subjective 
and objective evaluation criterion, depend- 
ing on subject taught or specific task, and 
associated with individual, team, and school 
performance. The evaluation process and its 
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results should be transparent to all parties 
involved. 

School and Individual Incentives 

Because some degree of teamwork character- 
izes all schools, incentives should balance indi- 
vidual rewards with school incentives. Design 
of these incentives should foster a cooperative 
culture, but not at the cost of an aggravated 
free-riding problem, a condition likely to arise 
when only group incentives are used. 

Monetary and Other Incentives 
Money is not alone in motivating perform- 
ance and promotions. Improved working 
conditions and increased decisionmaking au- 
thority may also be attractive to teachers, es- 
pecially when combined with bonuses or 
salary increases. The Israeli school incentive 
program included modest monetary bonuses 
but large rewards in the form of media atten- 
tion and enhanced reputation for the winning 
schools. Recognition and prestige are, it ap- 
pears, highly effective motivators. 

Eligibility and Size of Awards 

All teachers should be eligible for the incen- 
tive offered. Only when the majority of 
teachers are eligible to enjoy the benefits of 
hard work and improved outcomes will the 
incentive scheme be effective. However, only 
a subset of teachers should be rewarded in 
practice. If too many teachers are rewarded, 
teachers may not need to exert much effort 
to benefit. 

Flexibility 

A performance-related pay program can mo- 
tivate teachers and schools even more effec- 
tively if it also allows teachers to make deci- 
sions regarding instruction, curriculum, and 
other aspects of schooling that contribute to 
attaining the desired outcomes and goals. 
Creativity flourishes in the absence of bu- 


reaucratic constraints. The search for ways to 
improve teaching technology arises both be- 
cause teachers and schools have a direct 
stake in the matter and because merit pay 
plans reduce the need for regulation of in- 
puts and thus give teachers and schools more 
freedom. It is therefore essential that per- 
formance-related pay plans be accompanied 
by the increased flexibility required for their 
success. 

Conclusions 

Recent reforms in public education have in 
common a heightened stress on effective- 
ness — setting standards, measuring progress 
toward those standards, and imposing re- 
wards or penalties for meeting or failing to 
meet them. Performance-related pay systems 
use monetary rewards to meet these goals. In 
most cases these systems involve marginal 
changes rather than a complete revamping of 
the educational system. Research evidence 
suggests, though not conclusively, that incen- 
tive-based compensation can generate gains 
in student performance and teacher effec- 
tiveness. Teachers and schools appear to re- 
spond to monetary incentives by exerting 
more effort, applying more creativity, and 
modifying their pedagogical practices. Mea- 
suring and rewarding performance are thus 
potentially important elements in modern 
public school compensation systems. They 
can also, theoretically, discourage poor per- 
formers and attract better performers in the 
medium and longer terms. 

It is telling that private schools typically have 
more flexible pay structures than public 
schools and that they are more likely to use 
variations of performance-related pay. Dale 
Ballou reports that in 1993, 12 percent of 
U.S. public school districts had merit pay 
plans, as compared with 35 percent of non- 
sectarian private schools. 60 Merit pay can 
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clearly be integrated and implemented in the 
private schools. Political or technical con- 
straints may be impeding its success in public 
schools. 

Implementing performance-based pay re- 
quires meeting the daunting challenge of de- 
vising a system for measuring performance. 
The system must measure true performance 
in a way that minimizes random variation, as 
well as undesired and unintended conse- 
quences. It must align performance with ulti- 
mate outcomes and monitor performance to 
discourage cheating. Clearly, setting up an ef- 
fective performance-related pay system is not 
a one-time task, but an ongoing effort. Even 
with the best preparation, initial implementa- 
tion is likely to be problematic. Measure- 
ments will have a random component. Teach- 
ers will find ways to game the system. Any 
initial system will almost certainly be flawed. 
But if the effort is seen as an ongoing one, it 
should be possible to make progress gradu- 
ally in addressing each of the challenges. For 
example, better measurement instruments 


should shrink the random component in the 
measures, measurements can be adjusted to 
prevent gaming, careful monitoring can de- 
tect and therefore deter cheating. Eventually 
incentives will be developed that motivate 
the desired teaching behaviors and will be 
perceived by teachers as fair and accurate. 

Although the suggestions above are consis- 
tent with most of the limited empirical evi- 
dence available on the effectiveness of per- 
formance-related pay strategies, researchers 
have much work to do. Their efforts are im- 
peded both by the reluctance of school sys- 
tems to conduct careful evaluations and by a 
lack of appropriate outcome measures and 
comparison groups. Starting now, researchers 
should follow closely all efforts to implement 
performance-based pay in public schools, 
tracking carefully the behavioral changes in- 
duced and the ultimate outcomes of mone- 
tary rewards given to school staff. Meeting 
these research challenges is critical to the de- 
sign of effective performance-related pay sys- 
tems in the future. 
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